Exploitation of structural sparsity in algorithmic differentiation
نویسنده
چکیده
The background of this thesis is algorithmic differentiation (AD) [GW08] of in practice very computationally expensive vector functions F : R ⊇ D → R given as computer programs. Traditionally, most AD software1 provide forward and reverse modes of AD for calculating the Jacobian matrix ∇F (x) accurately at a given point x on some kind of internal representation of F kept on memory or hard disk. In fact, the storage is known to be the bottleneck of AD to handle larger problems efficiently in reverse mode. For instance, a tape is the internal representation of choice in the C++ operator overloading tool ADOL-C [GJM99] that presents an augmented version of F. Thus, ∇F can be obtained in forward and reverse fashion by an interpretative forward and reverse propagation of directional derivatives and adjoints [NMRC07] through the tape, respectively. The forward mode AD can be implemented very cheaply in terms of memory by single forward propagation of directional derivatives at runtime (tapeless in ADOL-C terminology). However, the reverse mode needs to store some data [HNP05] in the so-called forward sweep to allow the data flow reversal [Nau08] needed for backward propagation of adjoints. The latter is recently the focus of ongoing research activities of the AD community for m = 1 as a single application of reverse mode is enough to accumulate the gradient of F. To handle the memory bottleneck, checkpointing schedules e.g. revolve [GW00] have been developed for time-dependent problems. However, they require user’s knowledge in both the function F as well as the reverse mode AD. In this context, we aim to provide a tool, which minimizes non-AD experts effort in application of the reverse mode AD on their problems for large dimensions. Chapter 2 of this thesis is concerned with the accumulation of the Jacobian of F by the application of elimination techniques, which are very close to the Gaussian elimination performed in sparse LU factorization [PT08, FTPR04]. Thereby, we present algorithms that allow the application of elimination techniques [GN02] to the very large and sparse extended Jacobian of F being a lower triangular matrix of local partial derivative. However, the extended Jacobian is of quadratic memory complexity. Hence, compressed row storage [DER86] (CRS) representation is used to exploit its sparsity. This is done by first performing the so-called symbolic elimination step on the corresponding bit pattern of the extended Jacobian. This step predicts storage required for the statically allocated target CRS, which is used to accumulate the Jacobian of F at x. Nonetheless, the capability of the static CRS is also bounded by the memory consumption of the respective bit pattern even though the memory usage of CRS is considerably lower. To tackle this problem, elimination techniques are applied locally to the dense extended Jacobian (i.e without exploiting sparsity) and its CRS representation. Therefore, we keep track of the memory usage during the evaluation of F and apply elimination techniques whenever the memory bound is reached. The elimination is supposed to free memory enabling us to continue evaluating F. In fact, the evaluation of ∇F may require multiple evaluation and elimination steps. The former is supposed to provide the target data structure on which the latter is performed. We refer to this approach as iterative Jacobian accumulation. The implementations of the ideas above are provided in the C++ operator overloading tool DALG2 1Existing AD tools can be found on the community website www.autodiff.org. 2DALG stands for Derivative Accumulation for Large Graphs.
منابع مشابه
Exploiting Hierarchy for Ranking-based Recommendation
The purpose of this master's thesis is to study and develop a new algorithmic framework for collaborative filtering (CF) to generate recommendations. The method we propose is based on the exploitation of the hierarchical structure of the item space and intuitively"stands"on the property of Near Complete Decomposability (NCD) which is inherent in the structure of the majority of hierarchical sys...
متن کاملA NOVEL FUZZY-BASED SIMILARITY MEASURE FOR COLLABORATIVE FILTERING TO ALLEVIATE THE SPARSITY PROBLEM
Memory-based collaborative filtering is the most popular approach to build recommender systems. Despite its success in many applications, it still suffers from several major limitations, including data sparsity. Sparse data affect the quality of the user similarity measurement and consequently the quality of the recommender system. In this paper, we propose a novel user similarity measure based...
متن کاملNutrient compositional differentiation in the muscle of wild, inshore and offshore cage-cultured large yellow croaker (Pseudosciaena crocea)
The proximate composition, amino acids and fatty acids composition in the muscle of wild, inshore and offshore cage-cultured large yellow croaker, Pseudosciaena crocea (Richardson, 1846), were determined to identify nutritional differences. Wild fish groups showed highest content of moisture and crude protein, but the lowest lipid content. Offshore cage-cultured fish showed significantly higher...
متن کاملNew Acyclic and Star Coloring Algorithms with Application to Computing Hessians
Acyclic and star coloring problems are specialized vertex coloring problems that arise in the efficient computation of Hessians using automatic differentiation or finite differencing, when both sparsity and symmetry are exploited. We present an algorithmic paradigm for finding heuristic solutions for these two NP-hard problems. The underlying common technique is the exploitation of the structur...
متن کاملA new quadratic programming strategy for efficient sparsity exploitation in SQP-based nonlinear MPC and MHE ?
A large class of algorithms for nonlinear model predictive control (MPC) and moving horizon estimation (MHE) is based on sequential quadratic programming and thus requires the solution of a sparse structured quadratic program (QP) at each sampling time. We propose a novel algorithm based on a dual two-level approach involving a nonsmooth version of Newton’s method that aims at combining sparsit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011